Search CORE

41 research outputs found

Оценка экологической опасности отходов горнодобывающих предприятий республики Хакасия с применением метода биотестирования

Author: Bisani M.
Gollan Christian
Kanthak S.
Ney Hermann
Schlüter Ralf
Publication venue: Томский политехнический университет
Publication date: 01/01/2004
Field of study

Представлены результаты геохимического анализа проб отходов горнодобывающих предприятий Республики Хакасия и их биотестирования. При биотестировании в эксперименте с использованием тест-объекта Drosophila melanogaster оценивались: соотношение полов, морфозы, высота подъема куколок, средняя длина тела и крыла по отношению к концентрации пробы в среде. Сделаны выводы о воздействии отходов на живые объекты, выделены химические элементы, оказывающие токсическое действие

Electronic archive of Tomsk Polytechnic University

Step fluctuations and random walks

Author: Bartelt
Binder
Blagojevic
Bortz
Feller
Giesen
Giesen-Seibert
Ihle
Khare
Kuipers
M. Bisani
Majid
Marchenko
Pai
Pimpinelli
Press
Selke
Selke
Swartzentruber
Tanaka
van Beijeren
W. Selke
Wang
Weeks
Publication venue: 'Elsevier BV'
Publication date: 01/01/1999
Field of study

The probability distribution p(l) of an atom to return to a step at distance l from the detachment site, with a random walk in between, is exactly enumerated. In particular, we study the dependence of p(l) on step roughness, presence of other reflecting or absorbing steps, interaction between steps and diffusing atom, as well as concentration of defects on the terrace neighbouring the step. Applying Monte Carlo techniques, the time evolution of equilibrium step fluctuations is computed for specific forms of return probabilities. Results are compared to previous theoretical and experimental findings.Comment: 16 pages, 6 figure

arXiv.org e-Print Archive

Crossref

Publikationsserver der RWTH Aachen University

Evolutionary discriminative confidence estimation for spoken term detection

Author: A Sierra
A Sierra
Alejandro Echeverría
D Wang
D Watson
Dong Wang
HG Beyer
I Szöke
I Szöke
I Szöke
Javier Tejedor
K Thambiratmann
M Bisani
M Mitchell
M Rocha
M Rocha
O Cordón
R Damper
RA Fisher
Ravichander Vipperla
SH Chen
T Hain
T Mantere
W Daelemans
Z Michalewicz
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

The final publication is available at Springer via http://dx.doi.org/10.1007/s11042-011-0913-zSpoken term detection (STD) is the task of searching for occurrences of spoken terms in audio archives. It relies on robust confidence estimation to make a hit/false alarm (FA) decision. In order to optimize the decision in terms of the STD evaluation metric, the confidence has to be discriminative. Multi-layer perceptrons (MLPs) and support vector machines (SVMs) exhibit good performance in producing discriminative confidence; however they are severely limited by the continuous objective functions, and are therefore less capable of dealing with complex decision tasks. This leads to a substantial performance reduction when measuring detection of out-of-vocabulary (OOV) terms, where the high diversity in term properties usually leads to a complicated decision boundary. In this paper we present a new discriminative confidence estimation approach based on evolutionary discriminant analysis (EDA). Unlike MLPs and SVMs, EDA uses the classification error as its objective function, resulting in a model optimized towards the evaluation metric. In addition, EDA combines heterogeneous projection functions and classification strategies in decision making, leading to a highly flexible classifier that is capable of dealing with complex decision tasks. Finally, the evolutionary strategy of EDA reduces the risk of local minima. We tested the EDA-based confidence with a state-of-the-art phoneme-based STD system on an English meeting domain corpus, which employs a phoneme speech recognition system to produce lattices within which the phoneme sequences corresponding to the enquiry terms are searched. The test corpora comprise 11 hours of speech data recorded with individual head-mounted microphones from 30 meetings carried out at several institutes including ICSI; NIST; ISL; LDC; the Virginia Polytechnic Institute and State University; and the University of Edinburgh. The experimental results demonstrate that EDA considerably outperforms MLPs and SVMs on both classification and confidence measurement in STD, and the advantage is found to be more significant on OOV terms than on in-vocabulary (INV) terms. In terms of classification performance, EDA achieved an equal error rate (EER) of 11% on OOV terms, compared to 34% and 31% with MLPs and SVMs respectively; for INV terms, an EER of 15% was obtained with EDA compared to 17% obtained with MLPs and SVMs. In terms of STD performance for OOV terms, EDA presented a significant relative improvement of 1.4% and 2.5% in terms of average term-weighted value (ATWV) over MLPs and SVMs respectively.This work was partially supported by the French Ministry of Industry (Innovative Web call) under contract 09.2.93.0966, ‘Collaborative Annotation for Video Accessibility’ (ACAV) and by ‘The Adaptable Ambient Living Assistant’ (ALIAS) project funded through the joint national Ambient Assisted Living (AAL) programme

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Crossref

Biblos-e Archivo

Speech Processing and Prosody

Author: A Camacho
A Cheveigné de
BW Schuller
D Jouvet
D Talkin
JR Novak
K Bartkova
L Orosanu
M Benzeghiba
M Bisani
M Dargnat
M Eskenazi
M Schröder
M Stede
P Martin
R Shadiev
RB Lanjewar
SM Witt
V Sethu
VM Quang
Publication venue: HAL CCSD
Publication date: 10/09/2019
Field of study

International audienceThe prosody of the speech signal conveys information over the linguistic content of the message: prosody structures the utterance, and also brings information on speaker's attitude and speaker's emotion. Duration of sounds, energy and fundamental frequency are the prosodic features. However their automatic computation and usage are not obvious. Sound duration features are usually extracted from speech recognition results or from a force speech-text alignment. Although the resulting segmentation is usually acceptable on clean native speech data, performance degrades on noisy or not non-native speech. Many algorithms have been developed for computing the fundamental frequency, they lead to rather good performance on clean speech, but again, performance degrades in noisy conditions. However, in some applications, as for example in computer assisted language learning, the relevance of the prosodic features is critical; indeed, the quality of the diagnostic on the learner's pronunciation will heavily depend on the precision and reliability of the estimated prosodic parameters. The paper considers the computation of prosodic features, shows the limitations of automatic approaches, and discusses the problem of computing confidence measures on such features. Then the paper discusses the role of prosodic features and how they can be handled for automatic processing in some tasks such as the detection of discourse particles, the characterization of emotions, the classification of sentence modalities, as well as in computer assisted language learning and in expressive speech synthesis

Crossref

INRIA a CCSD electronic archive server

Read My Lips: Continuous Signer Independent Weakly Supervised Viseme Recognition

Author: C. Lucas
C. Shan
C. Vogler
C. Vogler
C.G. Fisher
D. Kelly
F.J. Och
G. Potamianos
G. Zhao
H. Cooper
M. Bisani
M.M. Zaki
N. Michael
R. Gross
S.C. Ong
T. Starner
Y.L. Tian
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Abstract. This work presents a framework to recognise signer indepen-dent mouthings in continuous sign language, with no manual annotations needed. Mouthings represent lip-movements that correspond to pronun-ciations of words or parts of them during signing. Research on sign lan-guage recognition has focused extensively on the hands as features. But sign language is multi-modal and a full understanding particularly with respect to its lexical variety, language idioms and grammatical structures is not possible without further exploring the remaining information chan-nels. To our knowledge no previous work has explored dedicated viseme recognition in the context of sign language recognition. The approach is trained on over 180.000 unlabelled frames and reaches 47.1 % precision on the frame level. Generalisation across individuals and the influence of context-dependent visemes are analysed

CiteSeerX

Crossref

Spoken term detection ALBAYZIN 2014 evaluation: overview, systems, results, and discussion

Author: A Abad
A Cardenal-Lopez
A Cardenal-López
A Jansen
A Jansen
A Martin
A Moreno
A Moreno
A Moreno-Sandoval
A Stolcke
Alejandro Coucheiro-Limeres
AM Azmi
Antonio Cardenal
Antonio Miguel
B Logan
B Logan
B Ma
B Taras
B Zhang
C Ni
C Parada
Carmen Garcia-Mateo
CJ Chen
D Can
D Karakos
D Povey
D Vergyri
D Vergyri
Doroteo T. Toledano
F Metze
F Metze
GJF Jones
H Joho
H Joho
H Su
H-Y Lee
H-Y Lee
HVD Heuvel
I Szöke
I Szöke
I-F Chen
I-F Chen
J Chiu
J Chiu
J Chiu
J Garofolo
J Li
J Mamou
J Mamou
J Pinto
J Tejedor
J Tejedor
J Trmal
J van Hout
Javier Tejedor
JG Fiscus
Julia Olcoz
Julian David Echeverry-Correa
K Iwata
K Thambiratmann
KM Knill
KM Knill
L Docío-Fernández
L Mangu
Laura Docio-Fernandez
LJ Rodríguez-Fuentes
M Bisani
M Cai
M Ma
M Saraclar
M Wollmer
M Zelenák
MJF Gales
MS Seigel
N Rajput
NF Chen
NF Chen
P Yu
Paula Lopez-Otero
R Justo
S Nakagawa
SP Rath
T Ng
T Ohno
T Sakai
V Mitra
V-B Le
X Anguera
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

The electronic version of this article is the complete one and can be found online at: http://dx.doi.org/10.1186/s13636-015-0063-8Spoken term detection (STD) aims at retrieving data from a speech repository given a textual representation of the search term. Nowadays, it is receiving much interest due to the large volume of multimedia information. STD differs from automatic speech recognition (ASR) in that ASR is interested in all the terms/words that appear in the speech data, whereas STD focuses on a selected list of search terms that must be detected within the speech data. This paper presents the systems submitted to the STD ALBAYZIN 2014 evaluation, held as a part of the ALBAYZIN 2014 evaluation campaign within the context of the IberSPEECH 2014 conference. This is the first STD evaluation that deals with Spanish language. The evaluation consists of retrieving the speech files that contain the search terms, indicating their start and end times within the appropriate speech file, along with a score value that reflects the confidence given to the detection of the search term. The evaluation is conducted on a Spanish spontaneous speech database, which comprises a set of talks from workshops and amounts to about 7 h of speech. We present the database, the evaluation metrics, the systems submitted to the evaluation, the results, and a detailed discussion. Four different research groups took part in the evaluation. Evaluation results show reasonable performance for moderate out-of-vocabulary term rate. This paper compares the systems submitted to the evaluation and makes a deep analysis based on some search term properties (term length, in-vocabulary/out-of-vocabulary terms, single-word/multi-word terms, and in-language/foreign terms).This work has been partly supported by project CMC-V2 (TEC2012-37585-C02-01) from the Spanish Ministry of Economy and Competitiveness. This research was also funded by the European Regional Development Fund, the Galician Regional Government (GRC2014/024, “Consolidation of Research Units: AtlantTIC Project” CN2012/160)

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Crossref

Springer - Publisher Connector

Repositorio Universidad de Zaragoza

Biblos-e Archivo

Getting Past the Language Gap: Innovations in Machine Translation

Author: A Fraser
A Maletti
AB Phillips
C David
C España-Bonet
D Chiang
D Wu
F Jelinek
F Sanchez-Martinez
FJ Och
FJ Och
GS Matthew
H Somers
HM Caseli
Huang Liang Hao Zhang, Daniel Gildea, Kevin Knight
I Alegria
I Cicekli
J Bellegarda
J Bellegarda
J Hutchins
K Baker
K Baker
K Owczarzak
L Levin
M Ashburner
M Bisani
N Collier
N Habash
N Ueffing
OF Josef
OF Josef
P Cimiano
P Vossen
S Ravi
T Green
Tong Xiao
V Pekar
W Mischo
Wei Wang
WJ Hutchins
Y Wilks
Y Wilks
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

In this chapter, we will be reviewing state of the art machine translation systems, and will discuss innovative methods for machine translation, highlighting the most promising techniques and applications. Machine translation (MT) has benefited from a revitalization in the last 10 years or so, after a period of relatively slow activity. In 2005 the field received a jumpstart when a powerful complete experimental package for building MT systems from scratch became freely available as a result of the unified efforts of the MOSES international consortium. Around the same time, hierarchical methods had been introduced by Chinese researchers, which allowed the introduction and use of syntactic information in translation modeling. Furthermore, the advances in the related field of computational linguistics, making off-the-shelf taggers and parsers readily available, helped give MT an additional boost. Yet there is still more progress to be made. For example, MT will be enhanced greatly when both syntax and semantics are on board: this still presents a major challenge though many advanced research groups are currently pursuing ways to meet this challenge head-on. The next generation of MT will consist of a collection of hybrid systems. It also augurs well for the mobile environment, as we look forward to more advanced and improved technologies that enable the working of Speech-To-Speech machine translation on hand-held devices, i.e. speech recognition and speech synthesis. We review all of these developments and point out in the final section some of the most promising research avenues for the future of MT

Archivio Ricerca Ca'Foscari

Crossref

Archivio istituzionale della ricerca - Università degli Studi di Venezia Ca' Foscari

Multigram-based grapheme-to-phoneme conversation for LVCSR

Author: Bisani M.
Ney Hermann
Publication venue
Publication date: 01/01/2003
Field of study

Investigations on Joint-Multigram Models for Grapheme-to-Phoneme Conversion

Author: H. Ney
M. Bisani
Publication venue
Publication date: 01/01/2002
Field of study

We present a fully data-driven, language independent way of building a grapheme-to-phoneme converter. We apply the joint-multigram approach to the alignment problem and use standard language modelling techniques to model transcription probabilities. We study model parameters, training procedures and effects of corpus size in detail. Experiments were conducted on English and German pronunciation lexica. Our proposed training scheme performs better than previously published ones. Phoneme error rates as low as 3:98% for English and 0:51% for German were achieved

CiteSeerX

Publikationsserver der RWTH Aachen University